STIC Biotechnology Systems Branch 



RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: /o/ ^^.DS^Q^ 

Source: ' 

Date Processed by STIC: //// /Ok 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.2.2 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkrnote,htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark OfiBce, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EPS Submission 

User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 223 13-1450 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 01/14/05): 
U.S. Patent and Trademark Ofifice, Mail Stop Sequence, Customer Window, Randolph Building, 401 Dulany Street, 
Alexandria, VA 22314 



Revised 01/24/05 




Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 




ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

1 Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 

Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 

2 Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

The numbering under each 5*^ amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved In ASCII text. 

Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present die maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 



3 Misaligned Amino 

Numbering 

4 ^Non-ASCII 



5 ^Variable Length 



6 ^Patentin 2.0 

"bug" 



7 Skipped Sequences 

(OLD RULES) 



8 Skipped Sequences 

(NEW RULES) 
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Use of n's or Xaa's 
(NEW RULES) 



Invalid <213> 
Response 

Useof<220> 



Patentin 2.0 
■"bug" 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 

P£r 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequenced s'^ 



missing the <220> "Feature" and associated numeric identifiers andj 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 

''Unknow n." Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 



Please do not use "Copy to Disk" function of Patentin version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



_ Misuse of n/Xaa "n" can only represent a single nucleotide: "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 



RAW SEQUENCE LISTING DATE: 01/11/2006 

PATENT APPLICATION: US/10/527,257 TIME: 14:12:07 



0 



Input Set : A:\186353 US - Sequence Listing 10 527257.txt 
Output Set: N:\CRP4\01112006\J527257.raw 

3 <110> APPLICANT: Wu, Jun 

4 Luo, Ying 

6 <120> TITLE OF INVENTION: TUMOR TAG AND THE USE THEREOF 

8 <130> FILE REFERENCE: 024455 
C--> 10 <140> CURRENT APPLICATION NUMBER: US/10/527,257 
C--> 10 <141> CURRENT FILING DATE: 2005-03-09 

10 <160> NUMBER OF SEQ ID NOS : 9 

12 <170> SOFTWARE: Patentin version 3.1 

14 <210> SEQ ID NO: 1 

15 <211> LENGTH: 720 

16 <212> TYPE: DNA tflOlV 

17 <213> ORGANISM: Homo sapiens DO©* ^^'^ «e N©©*''®*'' 

19 <220> FEATURE: l!v>rreCted O^*"^® 

20 <221> NAME/KEY: CDS COT* 

21 <222> LOCATION: (1) . . (639) 

22 <223> OTHER INFORMATION: 
W--> 25 <400> 1 

26 atg gca gcg gcc gcc age ccc gcg ttc ctt eta cgc etc ccg ctt etg 48 

27 Met Ala Ala Ala Ala Ser Pro Ala Phe Leu Leu Arg Leu Pro Leu Leu 

28 1 5 10 15 

30 etc ctg etg tec age tgg tgc agg ace ggg etg gcc gac ect cac tct 96 

31 Leu Leu Leu Ser Ser Trp Cys Arg Thr Gly Leu Ala Asp Pro His Ser 

32 20 25 30 

34 Ctt tgc tat gac ate acc gtc ate ect aag ttc aga ect gga cca egg 144 

35 Leu Cys Tyr Asp lie Thr Val lie Pro Lys Phe Arg Pro Gly Pro Arg 

36 35 40 45 

38 tgg tgt gcg gtt caa gge cag gtg gat gaa aag act ttt ctt cac tat 192 

39 Trp Cys Ala Val Gin Gly Gin Val Asp Glu Lys Thr Phe Leu His Tyr 

40 50 55 60 

42 gac tgt ggc age aag aca gtc aca ccc gtc agt ccc etg ggg aag aaa 240 

43 Asp Cys Gly Ser Lys Thr Val Thr Pro Val Ser Pro Leu Gly Lys Lys 

44 65 70 75 80 

46 eta aat gtc aca aeg gcc tgg aaa gca eag aac cca gta ctg aga gag 288 

47 Leu JVsn Val Thr Thr Ala Trp Lys Ala Gin Asn Pro Val Leu Arg Glu 

48 85 90 95 

50 gtg gtg gac ata ctt aca gag caa ctg ctt gac att cag ctg gag aat 336 

51 Val Val Asp lie Leu Thr Glu Gin Leu Leu Asp lie Gin Leu Glu Asn 

52 100 105 110 

54 tac ata ccc aag gaa ccc etc ace ctg cag gcc agg atg tct tgt gag 384 

55 Tyr lie Pro Lys Glu Pro Leu Thr Leu Gin Ala Arg Met Ser Cys Glu 

56 115 120 125 

58 eag aaa gee gaa gga cac ggc agt gga tct tgg cag etc agt ttc gat 432 

59 Gin Lys Ala Glu Gly His Gly Ser Gly Ser Trp Gin Leu Ser Phe Asp 
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RAW SEQUENCE LISTING DATE: 01/11/2006 

PATENT APPLICATION: US/10/527 , 257 TIME: 14:12:07 

Input Set : A:\186353 US - Sequence Listing 10 527257 .txt 
Output Set: N:\CRF4\01112006\J527257.raw 

60 130 135 140 

62 gga cag ate ttc etc etc ttt gac tea gaa aae aga atg tgg aca acg 480 

63 Gly Gin lie Phe Leu Leu Phe Asp Ser Glu Asn Arg Met Trp Thr Thr 

64 145 150 155 160 

66 gtt cat eet gga gee aga aag atg aaa gaa aag tgg gag aat gac aag 528 

67 Val His Pro Gly Ala Arg Lys Met Lys Glu Lys Trp Glu Asn Asp Lys 

68 165 170 175 

70 gat atg ace atg tec ttc cat tac ate tea atg gga gac tge aea gga 576 

71 Asp Met Thr Met Ser Phe His Tyr lie Ser Met Gly Asp Cys Thr Gly 

72 180 185 190 

74 tgg ctt gag gac ttc ttg atg gge atg gac age ace ctg gag eca agt 624 

75 Trp Leu Glu Asp Phe Leu Met Gly Met Asp Ser Thr Leu Glu Pro Ser 

76 195 200 205 

78 gca gga gge aca gtc tgacccaaag ccatggecac eaeeetcagt ccctgcagcc 679 

79 Ala Gly Gly Thr Val 

80 210 

82 tcctccteat cetcccctge ttcatcctec etggeatctg a 720 

85 <210> SEQ ID NO: 2 

86 <211> LENGTH: 213 

87 <212> TYPE: PRT 

88 <213> ORGANISM: Homo sapiens 
90 <400> SEQUENCE: 2 

92 Met Ala Ala Ala Ala Ser Pro Ala Phe Leu Leu Arg Leu Pro Leu Leu 

93 1 5 10 15 

96 Leu Leu Leu Ser Ser Trp Cys Arg Thr Gly Leu Ala Asp Pro His Ser 

97 20 25 30 

100 Leu Cys Tyr Asp lie Thr Val lie Pro Lys Phe Arg Pro Gly Pro Arg 

101 35 40 45 

104 Trp Cys Ala Val Gin Gly Gin Val Asp Glu Lys Thr Phe Leu His Tyr 

105 50 55 60 

108 Asp Cys Gly Ser Lys Thr Val Thr Pro Val Ser Pro Leu Gly Lys Lys 

109 65 70 75 80 

112 Leu Asn Val Thr Thr Ala Trp Lys Ala Gin Asn Pro Val Leu Arg Glu 

113 85 90 95 

116 Val Val Asp lie Leu Thr Glu Gin Leu Leu Asp lie Gin Leu Glu Asn 

117 100 105 110 

120 Tyr lie Pro Lys Glu Pro Leu Thr Leu Gin Ala Arg Met Ser Cys Glu 

121 115 120 125 

124 Gin Lys Ala Glu Gly His Gly Ser Gly Ser Trp Gin Leu Ser Phe Asp 

125 130 135 140 

128 Gly Gin lie Phe Leu Leu Phe Asp Ser Glu Asn Arg Met Trp Thr Thr 

129 145 150 155 160 

132 Val His Pro Gly Ala Arg Lys Met Lys Glu Lys Trp Glu Asn Asp Lys 

133 165 170 175 

136 Asp Met Thr Met Ser Phe His Tyr lie Ser Met Gly Asp Cys Thr Gly 

137 180 185 190 

140 Trp Leu Glu Asp Phe Leu Met Gly Met Asp Ser Thr Leu Glu Pro Ser 

141 195 200 205 
144 Ala Gly Gly Thr Val 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/527,257 



DATE: 01/11/2006 
TIME: 14:12:07 



Input Set : A:\186353 US - Sequence Listing 10 527257.txt 
Output Set: N:\CRP4\01112006\J527257.raw 



145 210 

148 <210> SEQ ID NO: 3 

149 <211> LENGTH: 29 

150 <212> TYPE: DNA ^^^-^ — 

151 <213> ORGANISM :(j>^i"^er 




153 <400> SEQUENCE: T"- ^ 

154 cggaattcat ggcagcggcc gccagcccc 

157 <210> SEQ ID NO: 4 

158 <211> LENGTH: 30 

159 <212> TYPE: DNA ^ 



160 <213> ORGANISM ^^irner; 

162 <400> SEQUENCE: 4 

163 gccaagcttg atgccaggga ggatgaagca 

166 <210> SEQ ID NO: 5 

167 <211> LENGTH: 34 

168 <212> TYPE: DNA 



171 <400> SEQUENCE:^ 

172 ccggaattcg accctcactc tctttgctat gaca 

175 <210> SEQ ID NO: 6 

176 <211> LENGTH: 30 

177 <212> TYPE: ^NA^_^_^^^ 

178 <213> ORGANI SM( ^i^mer ^ 

180 <400> SEQUENCEr"S 

181 gccaagcttg atgccaggga ggatgaagca 

184 <210> SEQ ID NO: 7 

185 <211> LENGTH: 21 

186 <212> TYPE: DNA r^^^T ^ 

187 <213> ORGANISM <|^ prlme]^ 

189 <400> SEQUENCE: 7 

190 atggcagcgg ccgccagccc c 

193 <210> SEQ ID NO: 8 

194 <211> LENGTH: 24 

195 <212> TYPE: DNA v 



198 <400> SEQUENCE: 8 

199 tcagatgcca gggaggatga agca 

202 <210> SEQ ID NO: 9 

203 <211> LENGTH: 742 
2 04 <212> TYPE: DNA 

205 <213> ORGANISM: Homo sapiens 

207 <400> SEQUENCE: 9 

2 08 atggcagcgg ccgccagccc cgcgttcctt ctacgcctcc cgcttctgct cctgctgtcc 

210 agctggtgca ggaccgggct ggccgaccct cactctcttt gctatgacat caccgtcatc 

212 cctaagttca gacctggacc acggtggtgt gcggttcaag gccaggtgga tgaaaagact 

214 tttcttcact atgactgtgg cagcaagaca gtcacacccg tcagtcccct ggggaagaaa 

216 ctaaatgtca caacggcctg gaaagcacag aacccagtac tgagagaggt ggtggacata 

218 cttacagagc aactgcttga cattcagctg gagaattaca tacccaagga acccctcacc 

220 ctgcaggcca ggatgtcttg tgagcagaaa gccgaaggac acggcagtgg atcttggcag 



30 



34 



30 



21 



24 



60 
120 
180 
240 
300 
360 
420 
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RAW SEQUENCE LISTING DATE: 01/11/2006 

PATENT APPLICATION: US/10/527,257 TIME: 14:12:07 

Input Set : A:\186353 US - Sequence Listing 10 527257.txt 
Output Set: N: \CRF4\01112006\J527257 • raw 



222 ctcagtttcg atggacagat cttcctcctc 

224 gttcatcctg gagccagaaa gatgaaagaa 

226 tccttccatt acatctcaat gggagactgc 

228 atggacagca ccctggagcc aagtgcagga 

230 caacccaggg ccacggccac caccctcatc 

232 atatgctcca ggcacagtct ga 



tttgactcag aaaacagaat gtggacaacg 480 
aagtgggaga atgacaagga tatgaccatg 540 
acaggatggc ttgaggactt cttgatgggc 600 
gcaccaccca ccatgtcctc aggcacagcc 660 
ctttgctgcc tcctcatcat gtgtctcctc 720 

742 
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VERIFICATION SUMMARY DATE: 01/11/2006 

PATENT APPLICATION: US/10/527,257 TIME: 14:12:08 

Input Set : A:\186353 US - Sequence Listing 10 527257.txt 
Output Set: N:\CRF4\01112006\J527257.raw 

L:10 M:270 C: Current Application Number differs, Replaced Current Application No 
L:10 M:271 C: Current Filing Date differs. Replaced Current Filing Date 
L:25 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 1, Line# : 22 
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